Overview
Brought to you by YData
Dataset statistics
| Number of variables | 19 |
|---|---|
| Number of observations | 10000 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 5.3 MiB |
| Average record size in memory | 552.4 B |
Variable types
| Text | 1 |
|---|---|
| Numeric | 11 |
| Categorical | 7 |
blood_pressure is highly overall correlated with bp_hr_interaction and 1 other fields | High correlation |
bp_glucose_ratio is highly overall correlated with glucose_level | High correlation |
bp_hr_interaction is highly overall correlated with blood_pressure and 3 other fields | High correlation |
cholesterol_level is highly overall correlated with pca2 | High correlation |
duration_per_hr is highly overall correlated with pca1 and 1 other fields | High correlation |
glucose_level is highly overall correlated with bp_glucose_ratio | High correlation |
heart_rate is highly overall correlated with bp_hr_interaction | High correlation |
pca1 is highly overall correlated with blood_pressure and 2 other fields | High correlation |
pca2 is highly overall correlated with bp_hr_interaction and 1 other fields | High correlation |
symptom_duration is highly overall correlated with duration_per_hr | High correlation |
patient_id has unique values | Unique |
pca1 has unique values | Unique |
pca2 has unique values | Unique |
age has 108 (1.1%) zeros | Zeros |
Reproduction
| Analysis started | 2025-04-14 16:52:38.488989 |
|---|---|
| Analysis finished | 2025-04-14 16:52:48.044133 |
| Duration | 9.56 seconds |
| Software version | ydata-profiling v0.0.dev0 |
| Download configuration | config.json |
Variables
patient_id
Text
Unique 
| Distinct | 10000 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 634.9 KiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Unique
| Unique | 10000 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | PAT00000 |
|---|---|
| 2nd row | PAT00001 |
| 3rd row | PAT00002 |
| 4th row | PAT00003 |
| 5th row | PAT00004 |
| Value | Count | Frequency (%) |
| pat00000 | 1 | < 0.1% |
| pat00008 | 1 | < 0.1% |
| pat00017 | 1 | < 0.1% |
| pat00002 | 1 | < 0.1% |
| pat00003 | 1 | < 0.1% |
| pat00004 | 1 | < 0.1% |
| pat00005 | 1 | < 0.1% |
| pat00006 | 1 | < 0.1% |
| pat00007 | 1 | < 0.1% |
| pat00009 | 1 | < 0.1% |
| Other values (9990) | 9990 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 14000 | |
| P | 10000 | |
| A | 10000 | |
| T | 10000 | |
| 6 | 4000 | 5.0% |
| 7 | 4000 | 5.0% |
| 3 | 4000 | 5.0% |
| 4 | 4000 | 5.0% |
| 5 | 4000 | 5.0% |
| 8 | 4000 | 5.0% |
| Other values (3) | 12000 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 80000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 14000 | |
| P | 10000 | |
| A | 10000 | |
| T | 10000 | |
| 6 | 4000 | 5.0% |
| 7 | 4000 | 5.0% |
| 3 | 4000 | 5.0% |
| 4 | 4000 | 5.0% |
| 5 | 4000 | 5.0% |
| 8 | 4000 | 5.0% |
| Other values (3) | 12000 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 80000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 14000 | |
| P | 10000 | |
| A | 10000 | |
| T | 10000 | |
| 6 | 4000 | 5.0% |
| 7 | 4000 | 5.0% |
| 3 | 4000 | 5.0% |
| 4 | 4000 | 5.0% |
| 5 | 4000 | 5.0% |
| 8 | 4000 | 5.0% |
| Other values (3) | 12000 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 80000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 14000 | |
| P | 10000 | |
| A | 10000 | |
| T | 10000 | |
| 6 | 4000 | 5.0% |
| 7 | 4000 | 5.0% |
| 3 | 4000 | 5.0% |
| 4 | 4000 | 5.0% |
| 5 | 4000 | 5.0% |
| 8 | 4000 | 5.0% |
| Other values (3) | 12000 |
age
Real number (ℝ)
Zeros 
| Distinct | 90 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 44.4528 |
| Minimum | 0 |
|---|---|
| Maximum | 89 |
| Zeros | 108 |
| Zeros (%) | 1.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 22 |
| median | 44 |
| Q3 | 67 |
| 95-th percentile | 85 |
| Maximum | 89 |
| Range | 89 |
| Interquartile range (IQR) | 45 |
Descriptive statistics
| Standard deviation | 25.9518 |
|---|---|
| Coefficient of variation (CV) | 0.58380574 |
| Kurtosis | -1.2028159 |
| Mean | 44.4528 |
| Median Absolute Deviation (MAD) | 22.5 |
| Skewness | 0.014441554 |
| Sum | 444528 |
| Variance | 673.49592 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 16 | 139 | 1.4% |
| 61 | 129 | 1.3% |
| 57 | 129 | 1.3% |
| 12 | 128 | 1.3% |
| 24 | 128 | 1.3% |
| 86 | 128 | 1.3% |
| 53 | 126 | 1.3% |
| 25 | 126 | 1.3% |
| 81 | 125 | 1.2% |
| 20 | 125 | 1.2% |
| Other values (80) | 8717 |
| Value | Count | Frequency (%) |
| 0 | 108 | |
| 1 | 117 | |
| 2 | 107 | |
| 3 | 108 | |
| 4 | 95 | |
| 5 | 106 | |
| 6 | 88 | |
| 7 | 113 | |
| 8 | 111 | |
| 9 | 113 |
| Value | Count | Frequency (%) |
| 89 | 124 | |
| 88 | 101 | |
| 87 | 111 | |
| 86 | 128 | |
| 85 | 118 | |
| 84 | 104 | |
| 83 | 111 | |
| 82 | 94 | |
| 81 | 125 | |
| 80 | 104 |
gender
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 605.7 KiB |
| Female | |
|---|---|
| Male | |
| Other | 193 |
Length
| Max length | 6 |
|---|---|
| Median length | 5 |
| Mean length | 5.0157 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Female |
|---|---|
| 2nd row | Male |
| 3rd row | Male |
| 4th row | Female |
| 5th row | Female |
Common Values
| Value | Count | Frequency (%) |
| Female | 4982 | |
| Male | 4825 | |
| Other | 193 | 1.9% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| female | 4982 | |
| male | 4825 | |
| other | 193 | 1.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 14982 | |
| a | 9807 | |
| l | 9807 | |
| F | 4982 | 9.9% |
| m | 4982 | 9.9% |
| M | 4825 | 9.6% |
| O | 193 | 0.4% |
| t | 193 | 0.4% |
| h | 193 | 0.4% |
| r | 193 | 0.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 50157 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 14982 | |
| a | 9807 | |
| l | 9807 | |
| F | 4982 | 9.9% |
| m | 4982 | 9.9% |
| M | 4825 | 9.6% |
| O | 193 | 0.4% |
| t | 193 | 0.4% |
| h | 193 | 0.4% |
| r | 193 | 0.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 50157 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 14982 | |
| a | 9807 | |
| l | 9807 | |
| F | 4982 | 9.9% |
| m | 4982 | 9.9% |
| M | 4825 | 9.6% |
| O | 193 | 0.4% |
| t | 193 | 0.4% |
| h | 193 | 0.4% |
| r | 193 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 50157 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 14982 | |
| a | 9807 | |
| l | 9807 | |
| F | 4982 | 9.9% |
| m | 4982 | 9.9% |
| M | 4825 | 9.6% |
| O | 193 | 0.4% |
| t | 193 | 0.4% |
| h | 193 | 0.4% |
| r | 193 | 0.4% |
blood_pressure
Real number (ℝ)
High correlation 
| Distinct | 817 |
|---|---|
| Distinct (%) | 8.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 120.13804 |
| Minimum | 62.2 |
|---|---|
| Maximum | 187.2 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | 62.2 |
|---|---|
| 5-th percentile | 95.7 |
| Q1 | 109.8 |
| median | 120.2 |
| Q3 | 130.5 |
| 95-th percentile | 145.1 |
| Maximum | 187.2 |
| Range | 125 |
| Interquartile range (IQR) | 20.7 |
Descriptive statistics
| Standard deviation | 15.091376 |
|---|---|
| Coefficient of variation (CV) | 0.12561697 |
| Kurtosis | -0.0085879674 |
| Mean | 120.13804 |
| Median Absolute Deviation (MAD) | 10.3 |
| Skewness | -0.011185183 |
| Sum | 1201380.4 |
| Variance | 227.74964 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 120.3 | 39 | 0.4% |
| 122.1 | 37 | 0.4% |
| 123.8 | 36 | 0.4% |
| 114.7 | 36 | 0.4% |
| 118.8 | 35 | 0.4% |
| 122.2 | 35 | 0.4% |
| 117.8 | 35 | 0.4% |
| 120.6 | 34 | 0.3% |
| 115.1 | 34 | 0.3% |
| 121.7 | 34 | 0.3% |
| Other values (807) | 9645 |
| Value | Count | Frequency (%) |
| 62.2 | 1 | |
| 65.2 | 1 | |
| 65.5 | 1 | |
| 70 | 2 | |
| 70.1 | 1 | |
| 70.4 | 1 | |
| 70.6 | 1 | |
| 71.2 | 1 | |
| 71.8 | 1 | |
| 72 | 1 |
| Value | Count | Frequency (%) |
| 187.2 | 1 | |
| 179.1 | 1 | |
| 174.2 | 1 | |
| 174 | 1 | |
| 169.3 | 1 | |
| 167.1 | 2 | |
| 166.5 | 1 | |
| 166.1 | 1 | |
| 165 | 2 | |
| 164.7 | 1 |
heart_rate
Real number (ℝ)
High correlation 
| Distinct | 573 |
|---|---|
| Distinct (%) | 5.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 74.92768 |
| Minimum | 30.3 |
|---|---|
| Maximum | 112.3 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | 30.3 |
|---|---|
| 5-th percentile | 58.795 |
| Q1 | 68.1 |
| median | 75 |
| Q3 | 81.7 |
| 95-th percentile | 91.4 |
| Maximum | 112.3 |
| Range | 82 |
| Interquartile range (IQR) | 13.6 |
Descriptive statistics
| Standard deviation | 9.9705304 |
|---|---|
| Coefficient of variation (CV) | 0.13306872 |
| Kurtosis | 0.010943293 |
| Mean | 74.92768 |
| Median Absolute Deviation (MAD) | 6.8 |
| Skewness | 0.012427005 |
| Sum | 749276.8 |
| Variance | 99.411477 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 69.7 | 54 | 0.5% |
| 76.8 | 52 | 0.5% |
| 76.5 | 52 | 0.5% |
| 75.9 | 50 | 0.5% |
| 73.7 | 50 | 0.5% |
| 68.9 | 49 | 0.5% |
| 71.4 | 49 | 0.5% |
| 71.9 | 48 | 0.5% |
| 75.2 | 48 | 0.5% |
| 74.4 | 48 | 0.5% |
| Other values (563) | 9500 |
| Value | Count | Frequency (%) |
| 30.3 | 1 | |
| 39.7 | 1 | |
| 40 | 1 | |
| 40.5 | 1 | |
| 40.7 | 1 | |
| 41.3 | 1 | |
| 41.6 | 1 | |
| 43 | 1 | |
| 43.4 | 1 | |
| 44.1 | 1 |
| Value | Count | Frequency (%) |
| 112.3 | 1 | |
| 111.9 | 1 | |
| 111 | 1 | |
| 108.5 | 2 | |
| 107.6 | 1 | |
| 107.2 | 1 | |
| 106.6 | 2 | |
| 106.3 | 1 | |
| 105.4 | 1 | |
| 105.1 | 2 |
glucose_level
Real number (ℝ)
High correlation 
| Distinct | 1278 |
|---|---|
| Distinct (%) | 12.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 99.97652 |
| Minimum | 9.2 |
|---|---|
| Maximum | 188.4 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | 9.2 |
|---|---|
| 5-th percentile | 58.895 |
| Q1 | 82.775 |
| median | 100 |
| Q3 | 117 |
| 95-th percentile | 140.9 |
| Maximum | 188.4 |
| Range | 179.2 |
| Interquartile range (IQR) | 34.225 |
Descriptive statistics
| Standard deviation | 25.04059 |
|---|---|
| Coefficient of variation (CV) | 0.25046471 |
| Kurtosis | -0.069181733 |
| Mean | 99.97652 |
| Median Absolute Deviation (MAD) | 17.1 |
| Skewness | -0.015863818 |
| Sum | 999765.2 |
| Variance | 627.03115 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 96.7 | 27 | 0.3% |
| 99.5 | 26 | 0.3% |
| 106 | 25 | 0.2% |
| 97.5 | 25 | 0.2% |
| 93.4 | 24 | 0.2% |
| 103.1 | 24 | 0.2% |
| 109.6 | 23 | 0.2% |
| 93.3 | 23 | 0.2% |
| 103.4 | 23 | 0.2% |
| 112.7 | 23 | 0.2% |
| Other values (1268) | 9757 |
| Value | Count | Frequency (%) |
| 9.2 | 1 | |
| 13.8 | 1 | |
| 14.1 | 1 | |
| 14.3 | 1 | |
| 15.2 | 1 | |
| 18.7 | 1 | |
| 19.5 | 1 | |
| 19.7 | 1 | |
| 20 | 1 | |
| 20.5 | 1 |
| Value | Count | Frequency (%) |
| 188.4 | 1 | |
| 183.4 | 1 | |
| 182.4 | 1 | |
| 182 | 1 | |
| 179.8 | 1 | |
| 178.5 | 1 | |
| 178.3 | 1 | |
| 177.6 | 1 | |
| 175.5 | 1 | |
| 175.2 | 1 |
cholesterol_level
Real number (ℝ)
High correlation 
| Distinct | 1865 |
|---|---|
| Distinct (%) | 18.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 180.04955 |
| Minimum | 8.2 |
|---|---|
| Maximum | 329.8 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | 8.2 |
|---|---|
| 5-th percentile | 113.8 |
| Q1 | 153.4 |
| median | 180.3 |
| Q3 | 206.8 |
| 95-th percentile | 245.6 |
| Maximum | 329.8 |
| Range | 321.6 |
| Interquartile range (IQR) | 53.4 |
Descriptive statistics
| Standard deviation | 39.839699 |
|---|---|
| Coefficient of variation (CV) | 0.22127075 |
| Kurtosis | -0.003167008 |
| Mean | 180.04955 |
| Median Absolute Deviation (MAD) | 26.7 |
| Skewness | -0.048719807 |
| Sum | 1800495.5 |
| Variance | 1587.2016 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 158.8 | 21 | 0.2% |
| 201.8 | 18 | 0.2% |
| 152.3 | 18 | 0.2% |
| 191.7 | 18 | 0.2% |
| 187 | 18 | 0.2% |
| 185.1 | 17 | 0.2% |
| 190.4 | 16 | 0.2% |
| 198.7 | 16 | 0.2% |
| 164.3 | 16 | 0.2% |
| 178.1 | 16 | 0.2% |
| Other values (1855) | 9826 |
| Value | Count | Frequency (%) |
| 8.2 | 1 | |
| 43.1 | 1 | |
| 47.3 | 1 | |
| 48.5 | 1 | |
| 49.4 | 1 | |
| 50.1 | 1 | |
| 50.5 | 1 | |
| 51 | 1 | |
| 52 | 1 | |
| 52.6 | 1 |
| Value | Count | Frequency (%) |
| 329.8 | 1 | |
| 316.1 | 1 | |
| 315.2 | 1 | |
| 308.9 | 1 | |
| 307.3 | 1 | |
| 302.6 | 1 | |
| 300 | 1 | |
| 299.4 | 1 | |
| 298.8 | 1 | |
| 298.2 | 1 |
symptom_duration
Real number (ℝ)
High correlation 
| Distinct | 29 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 14.8767 |
| Minimum | 1 |
|---|---|
| Maximum | 29 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 8 |
| median | 15 |
| Q3 | 22 |
| 95-th percentile | 28 |
| Maximum | 29 |
| Range | 28 |
| Interquartile range (IQR) | 14 |
Descriptive statistics
| Standard deviation | 8.4420628 |
|---|---|
| Coefficient of variation (CV) | 0.56746878 |
| Kurtosis | -1.2120203 |
| Mean | 14.8767 |
| Median Absolute Deviation (MAD) | 7 |
| Skewness | 0.015446318 |
| Sum | 148767 |
| Variance | 71.268424 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 404 | 4.0% |
| 23 | 372 | 3.7% |
| 27 | 370 | 3.7% |
| 13 | 368 | 3.7% |
| 7 | 368 | 3.7% |
| 29 | 360 | 3.6% |
| 2 | 357 | 3.6% |
| 11 | 354 | 3.5% |
| 16 | 354 | 3.5% |
| 12 | 351 | 3.5% |
| Other values (19) | 6342 |
| Value | Count | Frequency (%) |
| 1 | 404 | |
| 2 | 357 | |
| 3 | 346 | |
| 4 | 337 | |
| 5 | 340 | |
| 6 | 341 | |
| 7 | 368 | |
| 8 | 343 | |
| 9 | 340 | |
| 10 | 323 |
| Value | Count | Frequency (%) |
| 29 | 360 | |
| 28 | 322 | |
| 27 | 370 | |
| 26 | 334 | |
| 25 | 337 | |
| 24 | 340 | |
| 23 | 372 | |
| 22 | 335 | |
| 21 | 324 | |
| 20 | 321 |
clinical_notes
Categorical
| Distinct | 8 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 717.3 KiB |
| fever and fatigue | |
|---|---|
| blurred vision | |
| joint pain | |
| chest pain | |
| dizziness and confusion | |
| Other values (3) |
Length
| Max length | 23 |
|---|---|
| Median length | 19 |
| Mean length | 16.4387 |
| Min length | 10 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | headache and nausea |
|---|---|
| 2nd row | abdominal discomfort |
| 3rd row | fever and fatigue |
| 4th row | joint pain |
| 5th row | chest pain |
Common Values
| Value | Count | Frequency (%) |
| fever and fatigue | 1289 | |
| blurred vision | 1279 | |
| joint pain | 1269 | |
| chest pain | 1266 | |
| dizziness and confusion | 1236 | |
| abdominal discomfort | 1231 | |
| shortness of breath | 1223 | |
| headache and nausea | 1207 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| and | 3732 | |
| pain | 2535 | 10.2% |
| fever | 1289 | 5.2% |
| fatigue | 1289 | 5.2% |
| blurred | 1279 | 5.1% |
| vision | 1279 | 5.1% |
| joint | 1269 | 5.1% |
| chest | 1266 | 5.1% |
| confusion | 1236 | 5.0% |
| dizziness | 1236 | 5.0% |
| Other values (7) | 8545 |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 16184 | 9.8% |
| a | 16069 | 9.8% |
| 14955 | 9.1% | |
| i | 13821 | 8.4% |
| e | 13715 | 8.3% |
| s | 12360 | 7.5% |
| o | 11159 | 6.8% |
| d | 9916 | 6.0% |
| r | 7524 | 4.6% |
| t | 7501 | 4.6% |
| Other values (12) | 41183 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 164387 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| n | 16184 | 9.8% |
| a | 16069 | 9.8% |
| 14955 | 9.1% | |
| i | 13821 | 8.4% |
| e | 13715 | 8.3% |
| s | 12360 | 7.5% |
| o | 11159 | 6.8% |
| d | 9916 | 6.0% |
| r | 7524 | 4.6% |
| t | 7501 | 4.6% |
| Other values (12) | 41183 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 164387 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| n | 16184 | 9.8% |
| a | 16069 | 9.8% |
| 14955 | 9.1% | |
| i | 13821 | 8.4% |
| e | 13715 | 8.3% |
| s | 12360 | 7.5% |
| o | 11159 | 6.8% |
| d | 9916 | 6.0% |
| r | 7524 | 4.6% |
| t | 7501 | 4.6% |
| Other values (12) | 41183 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 164387 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| n | 16184 | 9.8% |
| a | 16069 | 9.8% |
| 14955 | 9.1% | |
| i | 13821 | 8.4% |
| e | 13715 | 8.3% |
| s | 12360 | 7.5% |
| o | 11159 | 6.8% |
| d | 9916 | 6.0% |
| r | 7524 | 4.6% |
| t | 7501 | 4.6% |
| Other values (12) | 41183 |
department
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 664.5 KiB |
| General Medicine | |
|---|---|
| Orthopedics | |
| Cardiology | |
| Neurology | |
| Emergency |
Length
| Max length | 16 |
|---|---|
| Median length | 11 |
| Mean length | 11.0332 |
| Min length | 9 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Orthopedics |
|---|---|
| 2nd row | Orthopedics |
| 3rd row | General Medicine |
| 4th row | Cardiology |
| 5th row | Orthopedics |
Common Values
| Value | Count | Frequency (%) |
| General Medicine | 2038 | |
| Orthopedics | 2033 | |
| Cardiology | 2000 | |
| Neurology | 1993 | |
| Emergency | 1936 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| general | 2038 | |
| medicine | 2038 | |
| orthopedics | 2033 | |
| cardiology | 2000 | |
| neurology | 1993 | |
| emergency | 1936 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 16050 | |
| o | 10019 | 9.1% |
| r | 10000 | 9.1% |
| i | 8109 | 7.3% |
| d | 6071 | 5.5% |
| l | 6031 | 5.5% |
| n | 6012 | 5.4% |
| c | 6007 | 5.4% |
| y | 5929 | 5.4% |
| g | 5929 | 5.4% |
| Other values (14) | 30175 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 110332 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 16050 | |
| o | 10019 | 9.1% |
| r | 10000 | 9.1% |
| i | 8109 | 7.3% |
| d | 6071 | 5.5% |
| l | 6031 | 5.5% |
| n | 6012 | 5.4% |
| c | 6007 | 5.4% |
| y | 5929 | 5.4% |
| g | 5929 | 5.4% |
| Other values (14) | 30175 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 110332 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 16050 | |
| o | 10019 | 9.1% |
| r | 10000 | 9.1% |
| i | 8109 | 7.3% |
| d | 6071 | 5.5% |
| l | 6031 | 5.5% |
| n | 6012 | 5.4% |
| c | 6007 | 5.4% |
| y | 5929 | 5.4% |
| g | 5929 | 5.4% |
| Other values (14) | 30175 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 110332 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 16050 | |
| o | 10019 | 9.1% |
| r | 10000 | 9.1% |
| i | 8109 | 7.3% |
| d | 6071 | 5.5% |
| l | 6031 | 5.5% |
| n | 6012 | 5.4% |
| c | 6007 | 5.4% |
| y | 5929 | 5.4% |
| g | 5929 | 5.4% |
| Other values (14) | 30175 |
admission_type
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 650.6 KiB |
| Outpatient | |
|---|---|
| Inpatient | |
| Emergency |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 9.6039 |
| Min length | 9 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Inpatient |
|---|---|
| 2nd row | Emergency |
| 3rd row | Outpatient |
| 4th row | Outpatient |
| 5th row | Inpatient |
Common Values
| Value | Count | Frequency (%) |
| Outpatient | 6039 | |
| Inpatient | 2513 | |
| Emergency | 1448 | 14.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| outpatient | 6039 | |
| inpatient | 2513 | |
| emergency | 1448 | 14.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 23143 | |
| n | 12513 | |
| e | 11448 | |
| p | 8552 | 8.9% |
| a | 8552 | 8.9% |
| i | 8552 | 8.9% |
| O | 6039 | 6.3% |
| u | 6039 | 6.3% |
| I | 2513 | 2.6% |
| E | 1448 | 1.5% |
| Other values (5) | 7240 | 7.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 96039 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| t | 23143 | |
| n | 12513 | |
| e | 11448 | |
| p | 8552 | 8.9% |
| a | 8552 | 8.9% |
| i | 8552 | 8.9% |
| O | 6039 | 6.3% |
| u | 6039 | 6.3% |
| I | 2513 | 2.6% |
| E | 1448 | 1.5% |
| Other values (5) | 7240 | 7.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 96039 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| t | 23143 | |
| n | 12513 | |
| e | 11448 | |
| p | 8552 | 8.9% |
| a | 8552 | 8.9% |
| i | 8552 | 8.9% |
| O | 6039 | 6.3% |
| u | 6039 | 6.3% |
| I | 2513 | 2.6% |
| E | 1448 | 1.5% |
| Other values (5) | 7240 | 7.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 96039 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| t | 23143 | |
| n | 12513 | |
| e | 11448 | |
| p | 8552 | 8.9% |
| a | 8552 | 8.9% |
| i | 8552 | 8.9% |
| O | 6039 | 6.3% |
| u | 6039 | 6.3% |
| I | 2513 | 2.6% |
| E | 1448 | 1.5% |
| Other values (5) | 7240 | 7.5% |
insurance_status
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 628.0 KiB |
| Insured | |
|---|---|
| Uninsured |
Length
| Max length | 9 |
|---|---|
| Median length | 7 |
| Mean length | 7.2982 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Insured |
|---|---|
| 2nd row | Insured |
| 3rd row | Insured |
| 4th row | Insured |
| 5th row | Uninsured |
Common Values
| Value | Count | Frequency (%) |
| Insured | 8509 | |
| Uninsured | 1491 | 14.9% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| insured | 8509 | |
| uninsured | 1491 | 14.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 11491 | |
| s | 10000 | |
| u | 10000 | |
| r | 10000 | |
| e | 10000 | |
| d | 10000 | |
| I | 8509 | |
| U | 1491 | 2.0% |
| i | 1491 | 2.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 72982 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| n | 11491 | |
| s | 10000 | |
| u | 10000 | |
| r | 10000 | |
| e | 10000 | |
| d | 10000 | |
| I | 8509 | |
| U | 1491 | 2.0% |
| i | 1491 | 2.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 72982 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| n | 11491 | |
| s | 10000 | |
| u | 10000 | |
| r | 10000 | |
| e | 10000 | |
| d | 10000 | |
| I | 8509 | |
| U | 1491 | 2.0% |
| i | 1491 | 2.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 72982 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| n | 11491 | |
| s | 10000 | |
| u | 10000 | |
| r | 10000 | |
| e | 10000 | |
| d | 10000 | |
| I | 8509 | |
| U | 1491 | 2.0% |
| i | 1491 | 2.0% |
diagnosis_code
Categorical
| Distinct | 8 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 595.8 KiB |
| D163 | |
|---|---|
| D141 | |
| D196 | |
| D185 | |
| D198 | |
| Other values (3) |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | D141 |
|---|---|
| 2nd row | D196 |
| 3rd row | D165 |
| 4th row | D196 |
| 5th row | D196 |
Common Values
| Value | Count | Frequency (%) |
| D163 | 2510 | |
| D141 | 1939 | |
| D196 | 1537 | |
| D185 | 1079 | |
| D198 | 995 | 10.0% |
| D119 | 952 | 9.5% |
| D165 | 499 | 5.0% |
| D143 | 489 | 4.9% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| d163 | 2510 | |
| d141 | 1939 | |
| d196 | 1537 | |
| d185 | 1079 | |
| d198 | 995 | 10.0% |
| d119 | 952 | 9.5% |
| d165 | 499 | 5.0% |
| d143 | 489 | 4.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 12891 | |
| D | 10000 | |
| 6 | 4546 | 11.4% |
| 9 | 3484 | 8.7% |
| 3 | 2999 | 7.5% |
| 4 | 2428 | 6.1% |
| 8 | 2074 | 5.2% |
| 5 | 1578 | 3.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 40000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 12891 | |
| D | 10000 | |
| 6 | 4546 | 11.4% |
| 9 | 3484 | 8.7% |
| 3 | 2999 | 7.5% |
| 4 | 2428 | 6.1% |
| 8 | 2074 | 5.2% |
| 5 | 1578 | 3.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 40000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 12891 | |
| D | 10000 | |
| 6 | 4546 | 11.4% |
| 9 | 3484 | 8.7% |
| 3 | 2999 | 7.5% |
| 4 | 2428 | 6.1% |
| 8 | 2074 | 5.2% |
| 5 | 1578 | 3.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 40000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 12891 | |
| D | 10000 | |
| 6 | 4546 | 11.4% |
| 9 | 3484 | 8.7% |
| 3 | 2999 | 7.5% |
| 4 | 2428 | 6.1% |
| 8 | 2074 | 5.2% |
| 5 | 1578 | 3.9% |
bp_hr_interaction
Real number (ℝ)
High correlation 
| Distinct | 9609 |
|---|---|
| Distinct (%) | 96.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9005.202 |
| Minimum | 3408.75 |
|---|---|
| Maximum | 16848 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | 3408.75 |
|---|---|
| 5-th percentile | 6404.9685 |
| Q1 | 7850.625 |
| median | 8920.7 |
| Q3 | 10077.062 |
| 95-th percentile | 11904.15 |
| Maximum | 16848 |
| Range | 13439.25 |
| Interquartile range (IQR) | 2226.4375 |
Descriptive statistics
| Standard deviation | 1673.1703 |
|---|---|
| Coefficient of variation (CV) | 0.18580042 |
| Kurtosis | 0.044800036 |
| Mean | 9005.202 |
| Median Absolute Deviation (MAD) | 1114.3 |
| Skewness | 0.26785306 |
| Sum | 90052020 |
| Variance | 2799498.8 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 8289.54 | 3 | < 0.1% |
| 8755.2 | 3 | < 0.1% |
| 8898.24 | 3 | < 0.1% |
| 8446.02 | 3 | < 0.1% |
| 8787.84 | 3 | < 0.1% |
| 8722.74 | 3 | < 0.1% |
| 8905.14 | 3 | < 0.1% |
| 8562.9 | 3 | < 0.1% |
| 8396.28 | 3 | < 0.1% |
| 9853.2 | 3 | < 0.1% |
| Other values (9599) | 9970 |
| Value | Count | Frequency (%) |
| 3408.75 | 1 | |
| 3452.68 | 1 | |
| 3628.8 | 1 | |
| 3996.27 | 1 | |
| 4107.2 | 1 | |
| 4371.68 | 1 | |
| 4388.8 | 1 | |
| 4390.2 | 1 | |
| 4411.88 | 1 | |
| 4413.2 | 1 |
| Value | Count | Frequency (%) |
| 16848 | 1 | |
| 16212.14 | 1 | |
| 15665.85 | 1 | |
| 15651 | 1 | |
| 15616.71 | 1 | |
| 15585.4 | 1 | |
| 15470.72 | 1 | |
| 14983.65 | 1 | |
| 14733.16 | 1 | |
| 14717.24 | 1 |
bp_glucose_ratio
Real number (ℝ)
High correlation 
| Distinct | 9852 |
|---|---|
| Distinct (%) | 98.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.2846013 |
| Minimum | 0.47418398 |
|---|---|
| Maximum | 13.166667 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | 0.47418398 |
|---|---|
| 5-th percentile | 0.79296594 |
| Q1 | 0.99437991 |
| median | 1.1881492 |
| Q3 | 1.4541427 |
| 95-th percentile | 2.0572817 |
| Maximum | 13.166667 |
| Range | 12.692483 |
| Interquartile range (IQR) | 0.4597628 |
Descriptive statistics
| Standard deviation | 0.48651987 |
|---|---|
| Coefficient of variation (CV) | 0.37873221 |
| Kurtosis | 67.678418 |
| Mean | 1.2846013 |
| Median Absolute Deviation (MAD) | 0.22139687 |
| Skewness | 4.9305291 |
| Sum | 12846.013 |
| Variance | 0.23670159 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 10 | 0.1% |
| 1.25 | 5 | 0.1% |
| 1.230769231 | 3 | < 0.1% |
| 1.264893617 | 3 | < 0.1% |
| 1.6 | 3 | < 0.1% |
| 1.666666667 | 3 | < 0.1% |
| 1.111111111 | 3 | < 0.1% |
| 0.9959718026 | 2 | < 0.1% |
| 1.042087542 | 2 | < 0.1% |
| 1.523809524 | 2 | < 0.1% |
| Other values (9842) | 9964 |
| Value | Count | Frequency (%) |
| 0.4741839763 | 1 | |
| 0.4814241486 | 1 | |
| 0.4857142857 | 1 | |
| 0.4944912508 | 1 | |
| 0.5104844541 | 1 | |
| 0.5339339339 | 1 | |
| 0.5390869293 | 1 | |
| 0.5414398064 | 1 | |
| 0.5513833992 | 1 | |
| 0.55532926 | 1 |
| Value | Count | Frequency (%) |
| 13.16666667 | 1 | |
| 9.483443709 | 1 | |
| 8.477124183 | 1 | |
| 8.398648649 | 1 | |
| 8.296296296 | 1 | |
| 6.671428571 | 1 | |
| 6.091370558 | 1 | |
| 5.900763359 | 1 | |
| 5.873170732 | 1 | |
| 5.768115942 | 1 |
duration_per_hr
Real number (ℝ)
High correlation 
| Distinct | 6065 |
|---|---|
| Distinct (%) | 60.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.19942696 |
| Minimum | 0.0091324201 |
|---|---|
| Maximum | 0.68292683 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | 0.0091324201 |
|---|---|
| 5-th percentile | 0.024600246 |
| Q1 | 0.099418584 |
| median | 0.19449044 |
| Q3 | 0.29285299 |
| 95-th percentile | 0.39187228 |
| Maximum | 0.68292683 |
| Range | 0.67379441 |
| Interquartile range (IQR) | 0.19343441 |
Descriptive statistics
| Standard deviation | 0.11734438 |
|---|---|
| Coefficient of variation (CV) | 0.58840782 |
| Kurtosis | -0.85571272 |
| Mean | 0.19942696 |
| Median Absolute Deviation (MAD) | 0.096708006 |
| Skewness | 0.20958758 |
| Sum | 1994.2696 |
| Variance | 0.013769705 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.3125 | 15 | 0.1% |
| 0.3333333333 | 14 | 0.1% |
| 0.1666666667 | 11 | 0.1% |
| 0.2857142857 | 11 | 0.1% |
| 0.1111111111 | 10 | 0.1% |
| 0.1818181818 | 10 | 0.1% |
| 0.1538461538 | 10 | 0.1% |
| 0.2631578947 | 10 | 0.1% |
| 0.1515151515 | 10 | 0.1% |
| 0.4166666667 | 9 | 0.1% |
| Other values (6055) | 9890 |
| Value | Count | Frequency (%) |
| 0.009132420091 | 1 | |
| 0.009871668312 | 1 | |
| 0.009881422925 | 1 | |
| 0.009950248756 | 1 | |
| 0.01012145749 | 1 | |
| 0.01014198783 | 1 | |
| 0.01020408163 | 2 | |
| 0.01028806584 | 1 | |
| 0.01037344398 | 1 | |
| 0.01043841336 | 1 |
| Value | Count | Frequency (%) |
| 0.6829268293 | 1 | |
| 0.5673758865 | 1 | |
| 0.5652173913 | 1 | |
| 0.5633802817 | 1 | |
| 0.56 | 1 | |
| 0.5523809524 | 1 | |
| 0.545112782 | 1 | |
| 0.5446623094 | 1 | |
| 0.5380333952 | 1 | |
| 0.5303030303 | 1 |
pca1
Real number (ℝ)
High correlation  Unique 
| Distinct | 10000 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -1.4210855 × 10-18 |
| Minimum | -3.8564275 |
|---|---|
| Maximum | 3.5560304 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 4991 |
| Negative (%) | 49.9% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | -3.8564275 |
|---|---|
| 5-th percentile | -1.6712525 |
| Q1 | -0.68633492 |
| median | 0.0016482659 |
| Q3 | 0.68907255 |
| 95-th percentile | 1.6817608 |
| Maximum | 3.5560304 |
| Range | 7.4124579 |
| Interquartile range (IQR) | 1.3754075 |
Descriptive statistics
| Standard deviation | 1.0155934 |
|---|---|
| Coefficient of variation (CV) | -7.146603 × 1017 |
| Kurtosis | -0.054121309 |
| Mean | -1.4210855 × 10-18 |
| Median Absolute Deviation (MAD) | 0.68773505 |
| Skewness | -0.010293349 |
| Sum | 1.7941204 × 10-13 |
| Variance | 1.0314299 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.6080990221 | 1 | < 0.1% |
| 2.512214335 | 1 | < 0.1% |
| -0.4051359761 | 1 | < 0.1% |
| -2.968708563 | 1 | < 0.1% |
| 0.3872240945 | 1 | < 0.1% |
| 0.6454981956 | 1 | < 0.1% |
| -1.011126221 | 1 | < 0.1% |
| 0.1357537134 | 1 | < 0.1% |
| -0.2662557181 | 1 | < 0.1% |
| -1.082001411 | 1 | < 0.1% |
| Other values (9990) | 9990 |
| Value | Count | Frequency (%) |
| -3.856427539 | 1 | |
| -3.833893972 | 1 | |
| -3.785308854 | 1 | |
| -3.77699147 | 1 | |
| -3.354287856 | 1 | |
| -3.344189025 | 1 | |
| -3.243180826 | 1 | |
| -3.219826576 | 1 | |
| -3.131529852 | 1 | |
| -3.115082028 | 1 |
| Value | Count | Frequency (%) |
| 3.556030362 | 1 | |
| 3.538797233 | 1 | |
| 3.18578794 | 1 | |
| 3.175855747 | 1 | |
| 3.168475046 | 1 | |
| 3.11398811 | 1 | |
| 3.100294964 | 1 | |
| 3.036329916 | 1 | |
| 3.033785069 | 1 | |
| 3.024745713 | 1 |
pca2
Real number (ℝ)
High correlation  Unique 
| Distinct | 10000 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.4158453 × 10-17 |
| Minimum | -3.6818499 |
|---|---|
| Maximum | 3.4352066 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 5006 |
| Negative (%) | 50.1% |
| Memory size | 78.2 KiB |
Quantile statistics
| Minimum | -3.6818499 |
|---|---|
| 5-th percentile | -1.6696831 |
| Q1 | -0.69276688 |
| median | -0.0011759472 |
| Q3 | 0.69401125 |
| 95-th percentile | 1.6714215 |
| Maximum | 3.4352066 |
| Range | 7.1170565 |
| Interquartile range (IQR) | 1.3867781 |
Descriptive statistics
| Standard deviation | 1.0133297 |
|---|---|
| Coefficient of variation (CV) | 4.1945141 × 1016 |
| Kurtosis | -0.1007268 |
| Mean | 2.4158453 × 10-17 |
| Median Absolute Deviation (MAD) | 0.69416779 |
| Skewness | -0.026855712 |
| Sum | 2.7000624 × 10-13 |
| Variance | 1.0268371 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -0.8633528449 | 1 | < 0.1% |
| 0.3900941681 | 1 | < 0.1% |
| 0.2679862773 | 1 | < 0.1% |
| 0.5268908697 | 1 | < 0.1% |
| -2.006806542 | 1 | < 0.1% |
| 1.033108587 | 1 | < 0.1% |
| -0.8806317605 | 1 | < 0.1% |
| 0.9196153406 | 1 | < 0.1% |
| -0.5958343407 | 1 | < 0.1% |
| -1.640295455 | 1 | < 0.1% |
| Other values (9990) | 9990 |
| Value | Count | Frequency (%) |
| -3.681849864 | 1 | |
| -3.535715733 | 1 | |
| -3.454774898 | 1 | |
| -3.379740781 | 1 | |
| -3.300899246 | 1 | |
| -3.294560814 | 1 | |
| -3.253245638 | 1 | |
| -3.227835549 | 1 | |
| -3.19527389 | 1 | |
| -3.176947553 | 1 |
| Value | Count | Frequency (%) |
| 3.435206644 | 1 | |
| 3.411808754 | 1 | |
| 3.377153945 | 1 | |
| 3.292105602 | 1 | |
| 3.275448621 | 1 | |
| 3.084011115 | 1 | |
| 3.043875728 | 1 | |
| 3.034376742 | 1 | |
| 2.991964088 | 1 | |
| 2.980582689 | 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 2 |
| 3rd row | 0 |
| 4th row | 3 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 2520 | |
| 3 | 2513 | |
| 1 | 2499 | |
| 2 | 2468 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 2520 | |
| 3 | 2513 | |
| 1 | 2499 | |
| 2 | 2468 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2520 | |
| 3 | 2513 | |
| 1 | 2499 | |
| 2 | 2468 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 10000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2520 | |
| 3 | 2513 | |
| 1 | 2499 | |
| 2 | 2468 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 10000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2520 | |
| 3 | 2513 | |
| 1 | 2499 | |
| 2 | 2468 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 10000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2520 | |
| 3 | 2513 | |
| 1 | 2499 | |
| 2 | 2468 |
Interactions
Correlations
| admission_type | age | blood_pressure | bp_glucose_ratio | bp_hr_interaction | cholesterol_level | clinical_notes | cluster | department | diagnosis_code | duration_per_hr | gender | glucose_level | heart_rate | insurance_status | pca1 | pca2 | symptom_duration | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| admission_type | 1.000 | 0.014 | 0.010 | 0.011 | 0.000 | 0.000 | 0.013 | 0.000 | 0.015 | 0.021 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 |
| age | 0.014 | 1.000 | 0.004 | 0.013 | 0.006 | 0.016 | 0.000 | 0.461 | 0.016 | 0.018 | 0.002 | 0.000 | -0.012 | 0.007 | 0.006 | -0.259 | 0.384 | 0.003 |
| blood_pressure | 0.010 | 0.004 | 1.000 | 0.430 | 0.675 | -0.004 | 0.007 | 0.127 | 0.019 | 0.000 | -0.014 | 0.000 | 0.005 | 0.025 | 0.000 | 0.548 | 0.348 | -0.010 |
| bp_glucose_ratio | 0.011 | 0.013 | 0.430 | 1.000 | 0.289 | -0.007 | 0.013 | 0.043 | 0.000 | 0.000 | 0.000 | 0.000 | -0.881 | 0.005 | 0.000 | -0.049 | -0.016 | -0.000 |
| bp_hr_interaction | 0.000 | 0.006 | 0.675 | 0.289 | 1.000 | 0.003 | 0.000 | 0.197 | 0.008 | 0.000 | -0.151 | 0.024 | 0.012 | 0.722 | 0.000 | 0.658 | 0.597 | -0.004 |
| cholesterol_level | 0.000 | 0.016 | -0.004 | -0.007 | 0.003 | 1.000 | 0.004 | 0.241 | 0.000 | 0.000 | 0.019 | 0.007 | 0.008 | 0.003 | 0.019 | -0.378 | 0.522 | 0.020 |
| clinical_notes | 0.013 | 0.000 | 0.007 | 0.013 | 0.000 | 0.004 | 1.000 | 0.008 | 0.011 | 0.008 | 0.000 | 0.000 | 0.004 | 0.014 | 0.018 | 0.000 | 0.000 | 0.018 |
| cluster | 0.000 | 0.461 | 0.127 | 0.043 | 0.197 | 0.241 | 0.008 | 1.000 | 0.000 | 0.013 | 0.437 | 0.000 | 0.124 | 0.158 | 0.000 | 0.250 | 0.434 | 0.456 |
| department | 0.015 | 0.016 | 0.019 | 0.000 | 0.008 | 0.000 | 0.011 | 0.000 | 1.000 | 0.010 | 0.000 | 0.000 | 0.000 | 0.010 | 0.000 | 0.000 | 0.012 | 0.005 |
| diagnosis_code | 0.021 | 0.018 | 0.000 | 0.000 | 0.000 | 0.000 | 0.008 | 0.013 | 0.010 | 1.000 | 0.001 | 0.005 | 0.004 | 0.003 | 0.000 | 0.007 | 0.006 | 0.000 |
| duration_per_hr | 0.000 | 0.002 | -0.014 | 0.000 | -0.151 | 0.019 | 0.000 | 0.437 | 0.000 | 0.001 | 1.000 | 0.000 | -0.006 | -0.198 | 0.000 | -0.512 | 0.265 | 0.972 |
| gender | 0.000 | 0.000 | 0.000 | 0.000 | 0.024 | 0.007 | 0.000 | 0.000 | 0.000 | 0.005 | 0.000 | 1.000 | 0.000 | 0.005 | 0.000 | 0.011 | 0.000 | 0.000 |
| glucose_level | 0.000 | -0.012 | 0.005 | -0.881 | 0.012 | 0.008 | 0.004 | 0.124 | 0.000 | 0.004 | -0.006 | 0.000 | 1.000 | 0.007 | 0.013 | 0.329 | 0.194 | -0.004 |
| heart_rate | 0.000 | 0.007 | 0.025 | 0.005 | 0.722 | 0.003 | 0.014 | 0.158 | 0.010 | 0.003 | -0.198 | 0.005 | 0.007 | 1.000 | 0.000 | 0.395 | 0.494 | 0.003 |
| insurance_status | 0.000 | 0.006 | 0.000 | 0.000 | 0.000 | 0.019 | 0.018 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.013 | 0.000 | 1.000 | 0.000 | 0.000 | 0.000 |
| pca1 | 0.000 | -0.259 | 0.548 | -0.049 | 0.658 | -0.378 | 0.000 | 0.250 | 0.000 | 0.007 | -0.512 | 0.011 | 0.329 | 0.395 | 0.000 | 1.000 | -0.007 | -0.441 |
| pca2 | 0.000 | 0.384 | 0.348 | -0.016 | 0.597 | 0.522 | 0.000 | 0.434 | 0.012 | 0.006 | 0.265 | 0.000 | 0.194 | 0.494 | 0.000 | -0.007 | 1.000 | 0.378 |
| symptom_duration | 0.000 | 0.003 | -0.010 | -0.000 | -0.004 | 0.020 | 0.018 | 0.456 | 0.005 | 0.000 | 0.972 | 0.000 | -0.004 | 0.003 | 0.000 | -0.441 | 0.378 | 1.000 |
Missing values
Sample
| patient_id | age | gender | blood_pressure | heart_rate | glucose_level | cholesterol_level | symptom_duration | clinical_notes | department | admission_type | insurance_status | diagnosis_code | bp_hr_interaction | bp_glucose_ratio | duration_per_hr | pca1 | pca2 | cluster | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | PAT00000 | 80 | Female | 125.9 | 61.3 | 136.3 | 95.7 | 15 | headache and nausea | Orthopedics | Inpatient | Insured | D141 | 7717.67 | 0.916970 | 0.240770 | 0.608099 | -0.863353 | 0 |
| 1 | PAT00001 | 7 | Male | 136.9 | 68.8 | 126.9 | 133.5 | 7 | abdominal discomfort | Orthopedics | Emergency | Insured | D196 | 9418.72 | 1.070367 | 0.100287 | 1.972034 | -1.228151 | 2 |
| 2 | PAT00002 | 34 | Male | 103.7 | 68.0 | 117.8 | 153.3 | 23 | fever and fatigue | General Medicine | Outpatient | Insured | D165 | 7051.60 | 0.872896 | 0.333333 | -0.702154 | -0.744649 | 0 |
| 3 | PAT00003 | 34 | Female | 135.6 | 65.0 | 102.4 | 203.4 | 5 | joint pain | Cardiology | Outpatient | Insured | D196 | 8814.00 | 1.311412 | 0.075758 | 0.586714 | -0.404099 | 3 |
| 4 | PAT00004 | 32 | Female | 151.6 | 78.8 | 141.5 | 251.0 | 12 | chest pain | Orthopedics | Inpatient | Uninsured | D196 | 11946.08 | 1.063860 | 0.150376 | 1.463709 | 1.911526 | 1 |
| 5 | PAT00005 | 4 | Male | 131.5 | 80.1 | 131.5 | 156.4 | 17 | dizziness and confusion | Cardiology | Outpatient | Insured | D196 | 10533.15 | 0.992453 | 0.209618 | 1.586058 | -0.032944 | 1 |
| 6 | PAT00006 | 40 | Female | 131.9 | 70.4 | 138.7 | 219.1 | 16 | headache and nausea | Neurology | Outpatient | Insured | D141 | 9285.76 | 0.944166 | 0.224090 | 0.379202 | 0.866842 | 1 |
| 7 | PAT00007 | 27 | Female | 122.5 | 94.4 | 117.4 | 269.5 | 17 | dizziness and confusion | Neurology | Emergency | Insured | D141 | 11564.00 | 1.034628 | 0.178197 | 0.308008 | 2.217885 | 1 |
| 8 | PAT00008 | 6 | Female | 112.1 | 57.5 | 103.1 | 242.6 | 27 | blurred vision | Neurology | Inpatient | Insured | D185 | 6445.75 | 1.076849 | 0.461538 | -1.812907 | -0.231049 | 1 |
| 9 | PAT00009 | 72 | Male | 131.4 | 75.6 | 90.0 | 163.8 | 7 | fever and fatigue | Emergency | Outpatient | Insured | D141 | 9933.84 | 1.443956 | 0.091384 | 0.596676 | 0.054748 | 3 |
| patient_id | age | gender | blood_pressure | heart_rate | glucose_level | cholesterol_level | symptom_duration | clinical_notes | department | admission_type | insurance_status | diagnosis_code | bp_hr_interaction | bp_glucose_ratio | duration_per_hr | pca1 | pca2 | cluster | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 9990 | PAT09990 | 84 | Female | 105.8 | 85.3 | 122.6 | 248.9 | 12 | abdominal discomfort | Cardiology | Emergency | Insured | D165 | 9024.74 | 0.855987 | 0.139050 | -0.726902 | 1.739419 | 3 |
| 9991 | PAT09991 | 31 | Male | 107.9 | 77.4 | 105.2 | 199.6 | 7 | headache and nausea | Cardiology | Outpatient | Insured | D119 | 8351.46 | 1.016008 | 0.089286 | 0.067431 | -0.416618 | 2 |
| 9992 | PAT09992 | 51 | Male | 93.3 | 79.7 | 101.1 | 142.3 | 10 | chest pain | General Medicine | Outpatient | Uninsured | D198 | 7436.01 | 0.913810 | 0.123916 | -0.231204 | -1.016425 | 2 |
| 9993 | PAT09993 | 82 | Male | 123.2 | 73.1 | 105.2 | 139.1 | 4 | abdominal discomfort | Neurology | Inpatient | Insured | D163 | 9005.92 | 1.160075 | 0.053981 | 0.690955 | -0.464461 | 3 |
| 9994 | PAT09994 | 27 | Female | 109.0 | 85.5 | 150.3 | 261.0 | 27 | dizziness and confusion | Cardiology | Outpatient | Uninsured | D119 | 9319.50 | 0.720423 | 0.312139 | -0.542440 | 2.046488 | 1 |
| 9995 | PAT09995 | 84 | Male | 144.9 | 59.0 | 101.4 | 164.8 | 1 | joint pain | Orthopedics | Outpatient | Insured | D163 | 8549.10 | 1.415039 | 0.016667 | 0.752763 | -0.455578 | 3 |
| 9996 | PAT09996 | 63 | Other | 126.1 | 66.0 | 107.3 | 139.4 | 14 | shortness of breath | Cardiology | Outpatient | Insured | D163 | 8322.60 | 1.164358 | 0.208955 | 0.211566 | -0.562600 | 0 |
| 9997 | PAT09997 | 78 | Female | 117.4 | 87.9 | 151.5 | 227.6 | 4 | abdominal discomfort | Orthopedics | Outpatient | Insured | D119 | 10319.46 | 0.769836 | 0.044994 | 0.878700 | 1.642466 | 3 |
| 9998 | PAT09998 | 77 | Female | 110.4 | 70.6 | 98.4 | 174.1 | 27 | abdominal discomfort | General Medicine | Outpatient | Insured | D119 | 7794.24 | 1.110664 | 0.377095 | -1.451846 | 0.484502 | 0 |
| 9999 | PAT09999 | 18 | Male | 97.6 | 53.5 | 143.2 | 98.9 | 28 | dizziness and confusion | Neurology | Inpatient | Uninsured | D196 | 5221.60 | 0.676838 | 0.513761 | -0.740673 | -2.159780 | 2 |